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We present a theoretical study of minimum error probability discrimination, using quantum- 
optical probe states, of M optical phase shifts situated symmetrically on the unit circle. We assume 
ideal lossless conditions and full freedom for implementing quantum measurements and for probe 
state selection, subject only to a constraint on the average energy, i.e., photon number. In particular, 
the probe state is allowed to have any number of signal and ancillary modes, and to be pure or 
mixed. Our results are based on a simple criterion that partitions the set of pure probe states into 
equivalence classes with the same error probability performance. Under an energy constraint, we 
find the explicit form of the state that minimizes the error probability. This state is an unentangled 
but nonclassical single-mode state. The error performance of the optimal state is compared with 
several standard states in quantum optics. We also show that discrimination with zero error is 
possible only beyond a threshold energy of (M — l)/2. For the M = 2 case, we show that the 
optimum performance is readily demonstrable with current technology. While transmission loss and 
detector inefficiencies lead to a nonzero erasure probability, the error rate conditional on no erasure 
is shown to remain the same as the optimal lossless error rate. 



I. INTRODUCTION 

The estimation of an optical phase shift using quantum 
states of light is a well-known theme of both theoretical 
and experimental studies, and is still an active area of 
research, as seen from a sample [T] of recent work. A 
less well-known but analogous sensing problem is that 
of discriminating a finite number M > 2 of phase shifts 
symmetrically arranged on the unit circle, which may be 
thought of as a discrete version of the phase estimation 
problem. The problems differ in the criterion used to 
measure performance - in the estimation problem, a typ- 
ical figure of merit is the mean squared error while an 
error probability criterion is natural for the discrimina- 
tion problem. 

The phase discrimination problem may be viewed in 
communication terms in analogy with M-ary phase shift 
keying (PSK) in ordinary and optical communications 
[21 In M-ary PSK, one of M uniformly spaced phase 
shifts is applied to a predetermined waveform for the pur- 
pose of communicating one of M messages (or, equiva- 
lently, log 2 M bits of data) from the sender to the re- 
ceiver. Binary PSK (BPSK) using coherent states of 
light, with error probabilities near the quantum limit in 
the absence of noise, is already a rather mature technol- 
ogy [J]. As the demand for high-speed communication 
increases, M-ary PSK with M > 2 is becoming attractive 
in optical communication, despite the increased system 
complexity, because it provides an increase in the num- 
ber of bits per transmitted symbol without an increase 
in the frequency bandwidth required [3J . In the quantum 
version of PSK (see, e.g., the studies [MS]) that is the 
subject of this paper, an optical mode prepared in a pre- 



determined quantum state is the analog of the classical 
signal waveform to which the information-bearing phase 
shifts arc applied. 

Another application of our study is to the recently de- 
veloped concept of quantum reading of a classical digital 
memory [5lU4). The original proposal of Ref. [5] con- 
sidered the use of a quantum-optical probe state that 
reads a standard optically encoded digital memory such 
as a CD or DVD with a bit error probability better than 
that achievable with standard laser sources. In a CD or 
DVD, information is stored by varying the properties of 
the recording surface in a rather involved manner - see, 
e.g., Ref. [TS]. However, the overall reading process may 
be modeled as the discrimination of two beam-splitter 
channels with transmittance depending on the data bit, 
i.e., the model involves an amplitude (rather than phase) 
encoding. In Ref. [10] . a general problem of discriminat- 
ing two beam-splitter channels was analyzed that models 
any kind of bit encoding, either in phase or amplitude or 
a combination of the two. 

The quantum reading and beam-splitter discrimina- 
tion problems of Refs. [9] and [TU] include optical loss 
naturally and correspond mathematically to the discrim- 
ination of non- unitary quantum channels. In Ref. [llj . 
a purely phase-encoded memory, which can be lossless 
in principle, was proposed. This memory encodes a bit 
0(1) as a (ir) radian phase shift imparted directly to a 
probe beam upon reflection from the encoding surface. 
As such, it corresponds exactly to the M = 2 case of 
the problem considered in this paper. Some variants 
of quantum reading that correspond to discrimination 
of unitary channels have been proposed and also exper- 
imentally demonstrated |16l 117] , Quantum reading of 
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an amplitude-encoded memory has been studied from an 
information-theoretic perspective in refs. [T2l [T5] . 

Viewed as a problem of quantum decision theory, 
the phase discrimination problem falls under the gen- 
eral rubric of distinguishing a symmetric set of quantum 
states that has been studied extensively [T5H2"5] . The 
optimal quantum measurement for a given probe state 
was obtained in the pioneering works [151-20] . This op- 
timal measurement, called variously as the Square-Root 
Measurement (SRM), Least-Squares Measurement, and 
Pretty Good Measurement, has many interesting proper- 
ties and has also been applied to quantum information 
theory [Hrl23]- 

In this paper, our concern is mainly with the design 
problem of choosing the best (i.e., yielding the least error 
probability) probe state under a given energy constraint. 
This problem has not been addressed in full generality in 
the literature, although related studies of quantum com- 
munication using M-ary PSK exist. Thus, for single- 
mode probe states, the problem of probe state optimiza- 
tion under an energy constraint was introduced and stud- 
ied in ref . [5] , but under a restricted class of allowed mea- 
surements. The possibility of zero-error communication 
was mentioned in Ref. [5] for single-mode states, while in 
ref. it was established that two-mode phase-conjugate 
PSK achieves the same end. The SRM was used to give 
a detailed performance evaluation of M-ary PSK for the 
case of the practically important coherent-state trans- 
mitters in ref. [7j. Very recently, a receiver that achieves 
near-optimal error probability for coherent-state M-PSK 
was proposed [8] and demonstrated for 4-PSK. However, 
a fully general treatment of the Af-ary phase discrimina- 
tion problem is as yet lacking, even in the ideal lossless 
case. In this paper, we present such an analysis and ob- 
tain the best probe state over all quantum states of any 
number of signal and ancillary modes under an average 
energy constraint. 

This paper is organized as follows. In Section II, we 
set up the mathematical model of the symmetric phase 
discrimination problem along with a discussion of the 
physical assumptions involved. Section III elaborates its 
solution as follows. In Sections III.A-III.C, we consider 
the case of a pure-state probe. In Section III. A, we ob- 
tain the optimal state under an energy constraint on the 
modes experiencing the phase shift. In Section III.B, we 
briefly discuss the associated quantum measurement that 
achieves the minimum error probability. In Section III.C, 
we consider the case of a combined energy constraint on 
all the probe-state modes. In Section III.D, the optimal- 
ity proof under both energy constraints is extended to al- 
low for mixed-state probes. In Section III.E, we present 
performance curves for several standard states in quan- 
tum optics alongside those of the optimal state and the 
coherent states. In Section IV, we present an easy im- 
plementation of M = 2 phase discrimination under both 
lossless and lossy conditions. In Section V, we conclude 
by discussing some possible future directions based on 
the present work. 




FIG. 1: Symmetric phase shifts on the unit circle for M — 3. 



II. SYMMETRIC PHASE DISCRIMINATION: 
PROBLEM SETUP 

A. Probe state 

Consider, for an integer M > 2, for 9m '■= 2tt/M, and 
for Z M :— {0, . . . , M — I}, the set {9 m :— iti9m I m G 
Zm} of phase shifts symmetrically disposed on the unit 
circle as in Fig. 1. The application of, say, the m-th such 
phase shift to each of J > 1 quasi-monochromatic opti- 
cal field modes (with annihilation operators {a^ }/ = i) is 
represented by the unitary operator 



U„ 



Jm8 M Ns j) = 



\U&\ me 1 M , (1) 



j a™ is the number operator of the j- 



where N. 

th mode, with 1 < j < J. These modes that undergo 
the phase shift are called signal modes, indicated by the 
subscript 'S' (see Fig. 2). In addition to the J signal 
modes, we also allow, as depicted in Fig. 2, any number 
J' > of ancillary modes. These are called idler modes, 
indicated by the subscript T, and have annihilation op- 
erators {&j ^}j, =1 - The idler modes do not acquire the 
m-dependent phase shift, but allow for the preparation of 
a quantum state that is entangled across the signal and 
idler modes. Such a joint state on the signal and idler 
modes will be called a probe state. We are interested in 
the problem of the choice of probe state that minimizes 
the error probability in determining m when the latter is 
drawn, unknown to the receiver, at random from Zm- 

An arbitrary pure probe state of J + J' modes may be 
written in the multimodc number basis as 



)is =^c k , n |k)/|n) 



(2) 



where |k)/ = |fci)®- • -®\kj>) and \n) s = |ni)®- • -®|7ij) 
are multimode Fock states. For any such probe state, the 
output states for the idler and return ('R') modes are 



\ip m )iR = h ® U m \ip)is, m G Z M) 



(3) 
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FIG. 2: A pure state 



of J signal modes (represented 



by the annihilation operators {Sg }/=i) and J' idler modes 

(represented by the annihilation operators {a^ }y =1 ) is pre- 
pared. The signal modes pass through a phase-shifting el- 
ement that modulates the phase of the incident light via 
one of the unitary transformations U m specified by Eq. |TJ). 
The return modes (represented by the annihilation operators 
{&» }/=i) and idler modes, the latter remaining unaffected by 
the phase shift, are measured using a minimum error proba- 
bility quantum measurement. 



where Ij is the identity transformation on the idler 
modes. The set of complex numbers {e zm6M \ to € Zm} 
is a cyclic group of order M under multiplication and 
consequently, so is the set of M unitary operators {Ij <X> 
U m | to € Zm}- Assuming equal a priori probabilities for 
the M unitaries, the output states of Eq. ^ satisfy the 
symmetric set condition of ref. [20j defined by 



U>m) = V m \lP), toGZm, 

v M = i, 



(4) 
(5) 



for some unitary operator V and some seed state [24] . 
The correspondence to our problem is obtained by taking 
the seed state in Q to be the probe state \ip)is and 
the generating unitary operator V of eqs. Q and ^ to 
be 



V = Ir 



J0 M N 



CrU) 



(6) 



with Ui given by eq. 0. 

The discrimination strategy of Fig. 2 may be called an 
entanglement-assisted parallel strategy in analogy with 
the terminology of ref. [35], and corresponds to Fig. 2d 
of [25] augmented with idler modes. This is clearly not 
the most general strategy. For example, we may consider 
sequential strategies (see Fig. 3 of |25|V It is easy to show 
that the simple sequential strategy depicted in Fig. 3a 
of [35] cannot help, as follows. Successive application 
of two phase shifts of mBu rad results in a phase shift 
2m0M rad which is also in the set {uiOm I m € Z«} due 
to the group property of Zj\/. Thus, at best, the set of 
output states after two applications of the phase shift is 
a permutation of the set of states after one application of 
the phase shift, and in general, the set of output states is 
even less distinguishable since distinct phase shifts may 
result in the same state after multiple applications even if 
they do not after a single application. That said, we have 
not ruled out the efficacy of more complicated strategies, 



such as the sequential strategy depicted in Fig. 36 of [25J 
or strategies that adaptively select the input states of 
later signal modes conditioned on measurement results 
from earlier modes (see, e.g., 26 ). Optimization over all 
such strategies appears to be an involved task and will 
not be considered in this paper. 

It is worthwhile to mention a few more features of our 
problem setup. First, our model of the phase discrimina- 
tion problem, as given by Eq. ([3 ) , assumes the presence 
of a phase reference in that we have the ability to pre- 
pare pure probe states and not just phase-averaged mixed 
states. Such a phase reference may be physically imple- 
mented, for example, by a separate strong coherent-state 
beam. Second, though given a phase reference, we may 
still prepare arbitrary mixed-state probes that are not 
represented in Eq. Q but must be considered in a fully 
general optimization. Nevertheless, it will be convenient 
to first address the pure-state probe case of Eq. ^ in 
detail. The mixed state case will be subsequently ad- 
dressed in Section III.D to extend the optimization to 
the full state space. 



B. Probe state design 

Accordingly, we are now interested in minimizing the 
error probability over a set of allowed pure probe states 
\ip)zs- This logically entails two successive minimiza- 
tions. For a chosen probe state \tp)is, the minimum aver- 
age error probability achievable in the sense of Helstrom 
[TO] is given by 



1 



Af-l 

— max > tr 
M {E m } ^ n 

|_ TO J m — (J 



III ■n 



E„ 



(7) 



where {E m }^~Q is a set of positive semidefinite operators 
constituting a positive-operator- valued measure (POVM) 
[19] that represents any quantum measurement process 
that discriminates the M possibilities. The optimization 
over all POVM's in eq. ^ corresponds to the choice of 
the optimum measurement process for a given probe state 
\i/>)iS" We are interested in minimizing ^ further over 
a set of 'allowed' probe states as detailed below. 

Intuitively, it is clear that, if no further restrictions are 
imposed, one may be able to achieve arbitrarily small er- 
ror probability by choosing a probe state with sufficiently 
high energy. For example, we may use a single-mode co- 
herent state probe \y/N$) of average photon number Ng, 
leading to the output states {\VNse imf>M )}^ . As N s 
is increased, the output states become more and more or- 
thogonal and the error probability decreases. In the limit 
Ns — > oo, we have P e — > 0. Thus, further constraints are 
required to make the optimization problem meaningful. 
In the communications literature, an energy constraint is 
typically imposed in addition to a limit on the number 
of modes J, i.e., the bandwidth. In the present context, 
an energy constraint is particularly important because it 
is practically hard to prepare novel quantum states with 
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a large average photon number. One can also imagine 
scenarios, e.g., that of probing a sensitive biological sam- 
ple, where a signal energy constraint must be imposed to 
avoid damaging the sample during probing. In view of 
the above considerations, we constrain the average total 
photon number in the signal modes 



(N s ) = 




(8) 



Ns being a given number. The case of a constraint on 
the average photon number of the signal plus idler modes 
is considered in Section III.C. For brevity, we will simply 
write 'signal energy' for the average total photon number 
in the signal modes given by the left-hand side of Eq. (|8j . 
Using eq. ([2]), we calculate the signal energy 



Tlx 



nj) Ck,n 



E (Vi + ---+?V)I>k,n| 2 ^ 

^ {n x H h nj)p n 



(9) 

(10) 
(11) 
(12) 



where p n is the probability that the total photon num- 
ber n\ + • • • + nj in the signal modes is n. We denote 
the signal photon probability distribution by the infinite- 
dimensional vector p = (foiPii ■ ••)■ 

In the sequel, another discrete probability distribution 
derived from the probe state plays a major role. It de- 
pends on both the probe state (through p) and on M, 
as follows. For v G Z M , let p = (p , . . . , p„, . . . , p M -i) be 
defined component-wise as 



n : n= v (mod M) 



In other words, p is the probability distribution induced 
by p on the modulo-M congruence classes of the total 
signal photon number. 



III. SYMMETRIC PHASE DISCRIMINATION: 
PROBLEM SOLUTION 

A. Pure probe states 

It will be revealing to approach the problem set up in 
Section II in stages. We will comment as we go along on 
the implications of our results and their connections to 
the literature. As in Section II, we continue to assume 
a pure probe state and address the use of mixed probe 
states in Section III.D. 



We first present the basic result that, without invoking 
a signal energy constraint or specifying J and J', we can 
partition the probe state space into equivalence classes 
of states having the same error probability. 

Theorem 1. Pure probe states with the same p have the 
same performance in the discrimination of M symmetric 
phases. This statement encompasses probes with differing 
J and/or J' . 



Proof. For an arbitrary probe state \ip)is written in 
the form of Eq. (|2|), the corresponding output states 

m )ir} 



\i>m)iR = J2 c *>« eim<>U{ni ~ 

k,n 



• +nj) |k)/|n) fl . 



(14) 



Consider the MxM matrix G (the Gram matrix) whose 
elements are all the mutual inner products between the 
{Hm)iR}, i-e-, 



Gr. 



■= m(i>m\ip' 



m'lIR- 



(15) 



The minimum error probability ^ in discriminating the 
symmetric set of pure states {\ip m } iR}m=o ^ s a function 
of the elements of G alone (37J . We compute the general 
element of G to be 



m' IB. 

= 5Z l C k,n 



2 e -i0M(m— m')(»iH hnj) 



k,n 



E„ _-i0M(m-m')(niH h«j) 
Pn e 



n 

oo 

= X! Pn e ~ 

M-l 



iO m {m—m )n 



-id m (m— m )y 



(16) 
(17) 

(18) 
(19) 
(20) 



(13) The equality (20) follows because, for any m and m', the 



exponential factor is periodic in n with period M. We 
have thus shown that the Gram matrix, and thence the 
error performance, is a function of just the M compo- 
nents of p, and that this is true irrespective of the values 
of J and J'. □ 

The result of Theorem 1 is interesting for a number of 
reasons. First, it clusters probe states into classes with 
the same error performance based on the easily computed 
characteristic p. Second, since one can always prepare a 
signal-only (J' = 0) probe with a given p, the ancillary 
idler modes shown in Fig. 2 do not improve performance. 
This is unlike the typical situation in which ancillary en- 
tanglement in the probe helps in distinguishing M uni- 
tary transformations |28j . Third, since any given p can be 
realized using a single- mode signal state (i.e., with J = 1 
as in Theorem 2 below), no performance gain accrues 
from using multiple signal modes. This can again be con- 
trasted with the situation of distinguishing two general 
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finite-dimensional unitarics, for which multiple applica- 
tions of the unitaries can result in error-free discrimina- 
tion [29]. We mention that these latter two implications 
of Theorem 1 also follow from a general lossless image 
sensing result of ref. [3D] (see section on lossless imaging 
therein). Finally, the freedom of probe state choice al- 
lowed by Theorem 1 will turn out (in Section IV) to be 
crucial to a practical implementation of the M = 2 case. 

Our next result implies that, when the signal energy 
is constrained as in Q to a maximum of Ns, the most 
efficient way to use the energy is to use a probe state 
with p supported on just its first M components. It 
is also shown that, beyond a threshold signal energy of 
(M — l)/2, discrimination with zero error is possible. 

Theorem 2. (a) For N$ < (M — l)/2, a single-mode 

probe state of the form \tp)s — Ylu=o W)s with 
Pj, > achieves the minimum error probability. 

(b) For Ns > (M—l)/2, the uniform superposition state 

\fp)s = (l°)s H \-\M - l) s ) achieves perfect 

discrimination. 

Proof, (a) Presented with any probe state \ip)is with as- 
sociated p and p, we can construct the single-mode probe 



M-l 

E 

v=0 



'Pv WIS- 



(21) 



This state has the same p, so by Theorem 1 it has 
the same performance as the original one. Moreover, 
because the probabilities p n that \ip)is associates with 
n > M all contribute to photon numbers less than M 
in \ip)s, the total signal energy in (21), as given by 



Eq. (12 1, can only be equal to or lower than that of 



the original probe state. Thus, (21) provides the same 



performance at equal or lesser signal energy than any 
state with the same p. 



(b) If N s > {M - l)/2, consider the probe state 



M-l 



\1>h 



Vm 



Eh 



(22) 



which has energy (M — l)/2 and is therefore allowed by 
the energy constraint. From (20), we see that G mm ' = 
<5m,m' so that the output states are mutually orthogonal 
and the error probability is zero. □ 

The conclusion of Theorem 2(6) that discrimination 
with zero error is possible whenever N$ > (M — l)/2 is 
remarkable. In a communications framework, it implies 
that if a signal energy of at least (M — l)/2 is available, 
we can communicate one of M messages without error 
using phase modulation of the state ( 22 ) . This conclusion 



was noted in [5], although this, or any other, single-mode 
state cannot achieve error-free communication under the 
restricted class of measurements allowed in [5]. In [5J, 
an alternative scheme using two signal modes suffering 



conjugate phase shifts of 6 m and —0 m respectively was 
proposed that achieves the same end. 

Recently, it was shown in |11) that using the probe 
state 



)ecs = Ty= (\a)i\a) s 



\-a) z \-a) s ), (23) 



for reading a binary phase-encoded memory (so that the 
output states correspond to Eq. ([3| with M = 2) results 
in zero-error discrimination. Here, | ± a)s and | ± a)i 
are coherent states (with a/0 but otherwise arbitrary) 
and the normalization factor Af = ^2(1 - e- 4 H 2 ). The 
state ( |23[ ) is an example of an entangled coherent state 

(ecs) TS]. 

The existence of several states allowing zero-error dis- 
crimination is in itself not surprising if we apply Theorem 
1 to the state (22), because according to that theorem, 
any state with uniform p = (1/M, . . . , 1 /M) must provide 
zero-error discrimination and we can clearly write down 
an infinite number of states with uniform p. A more inter- 
esting question is whether there exist states with nonuni- 
form p, or with signal energy less than (M — l)/2, that 
also allow zero-error discrimination. Theorem 3(6) below 
implies that the answer to both questions is negative, so 
that a signal energy of at least (Af — l)/2 is a necessary 
condition for zero-error discrimination. In this connec- 



tion, it may be verified that the ECS of Eq. (23) has 
p = (1/2, 1/2) and signal energy \a\ 2 / (2tanh|a| ; 7> 1/2 
for \a\ > [3J]. 

The next result gives the form of the optimal probe 
state for a signal energy constraint Ns < (M — l)/2 and 
also shows that the only probe states achieving zero-error 
discrimination have uniform p, and therefore have signal 
energy greater than or equal to (M — l)/2. 

Theorem 3. (a) Among all probe states satisfying 
(Ns) < Ns < (M — l)/2, the minimum error proba- 
bility is achieved by the state 



M-l 



i^)o Pt = vp<> \ v )s, 



with p given by 



K (A + vBf 



(24) 



(25) 



where A, B are positive constants chosen to satisfy 
the constraints 



M-l 



M-l 



(26) 



!/=0 



v=0 



(b) Any probe state achieving zero-error discrimination 
must have p = (1/M, . . . , 1/M) and signal energy 
greater than or equal to (M — l)/2. 
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Proof, (a) By Theorem 2(a), it suffices to consider a 
single- mode probe state of the form |2l| ). As shown in 
Section II, the output states {\ipm)iR.}o 1 form a sym- 
metric set in the sense of having equal a priori proba- 
bilities and satisfying Eqs. Q and It was shown in 
Refs. [TBI HO] that the Square- Root Measurement is the 
minimum error probability measurement for any sym- 
metric pure-state set. An explicit formula exists for this 
minimum error probability [7j HH [20] . We use the fol- 
lowing expression from ref. [7]:- 



P, 




A 'P \ ^ 

i rn— 



(27) 



where A = (Ao, . . . , Am-i) is the vector of eigenvalues of 
the Gram matrix G [33 . The ordered vector of eigenval- 
ues A is specified by the formula (cf. Eq. (42) of [7J) 

M-l 

\n' = J2 lR^rn)lR ' fT*"'™*" (28) 

m=0 
M-l 

= Yl Go m e- im ' meM . (29) 



m=0 



Note that the eigenvalues only depend on the first row of 
the Gram matrix - indeed, the symmetric set property 
of the {\ip m }iR} guarantees that the remaining rows are 
obtained by cyclically shifting the first. While the order- 
ing of eigenvalues in A enables writing down the compact 
formula ( 28 I and has a physical interpretation that will 



appear, note that the error probability itself does not 
depend on the ordering. 
We may rewrite (291 as 



A = T [G ] , 



(30) 



where T is the Discrete Fourier Transform (DFT) on Zm 
and Go = {Go m } is the first row of the Gram matrix. On 
the other hand, ( 20 ) implies that 



G = M • J- 1 [p] , (31) 
where J 7-1 is the inverse DFT. We therefore have 



A = Mp, 

which gives a physical interpretation for A. 



(32) 



Minimizing P e for states of the form (21) with signal 



energy (Ns) = Ns is then equivalent to maximizing the 
concave function y/p^ over the convex set of p's 

for which the probability normalization constraint 



M-l 

E 

u=0 



and the signal energy constraint 



M-l 



vi? v = N s 



(33) 



(34) 



are satisfied. Following the usual Lagrange multiplier 
method, we define 

F(po,...,p M _ 1 ,A, B) 

M-l /M-l \ /M-l \ 

= y,vk-a x>„-i )- B E^-^ - 

u=0 \ u=Q ) \ i/=0 / 

(35) 

and solve the equation VF = 0. The solution is 



1 

(A + vBf 



j/ <G Z 



M ■ 



where A, B are chosen such that X^^Iq 1 = ^ anc ^ 



(36) 



^*^ ll/ Piy — Ns- The point p defined by (36) is an 



interior point of the domain of optimization and a local 
maximum by the gradient condition. Since the function 
being maximized is concave, it is also a global maximum 
on the domain 34J. Thus, the state (36) achieves the 



minimum error probability among probe states with 
energy exactly Ns- Lemma 1, below, establishes that the 
state ( 36 ) is also optimal under the inequality constraint 



(b) From (27), it is 
v M-i ,— — 



evident that P e — if only 
It is easy to verify that the 



maximum value of the quantity J"^,_ y/f^ under just 

the constraint X^io 1 = 1 is \[M and is achieved only 
if p„ = 1/M, < v < M — 1. Thus, the states providing 
zero-error discrimination are exactly those with uniform 
p. Such a state has signal energy at least (M — l)/2. □ 

The following result may be expected on physical 
grounds (although it does not hold for Ns > (M — l)/2), 
but we need a proof because this fact is used to complete 
the proof of Theorem 2(a). 

Lemma 1. The optimum single-mode probe state of the 
form ( |21[ ) under the inequality constraint (Ns) < Ns < 
(M — l)/2 is the same as the optimum state under the 
equality constraint (Ns) = Ns- 

Proof. Consider the maximization of J^^L"^ 1 under 

the probability constraint Y^=o = ^ s snown m 
the proof of Theorem 3(6), the maximum is achieved at 
p* = (1/M, . . . , 1/M). Denote the p of an optimal state 
under the inequality constraint by p^. Denote its signal 
energy by N s < Ns- Since the set of all p is convex, 
the line segment L joining p^ to p* in that set consists of 
allowed p's. Since the signal energy is a linear function 
of p, L contains states of signal energy ranging from N s 
to (M — l)/2. Further, since the function J^iLo 1 y/Pv is a 
concave function of p whose maximum is attained at p*, 
the function must be nondecreasing as we move along L 
from p„ to p* [34]. In particular, we can find a state on L 
with signal energy Ns and equal or better performance 
than that obtainable from p t . Consequently, there is an 
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optimal state under the inequality constraint with signal 
energy exactly N$- One such state must be that given 

by m. □ 



The optimum probe state for binary phase discrimina- 
tion is particularly straightforward. 



Corollary 1 (Binary case). For M 
probe state for N$ < 1/2 is 



= y/l-N S \0) S + 



2, the optimum 



(37) 



Proof. This follows immediately by solving for A and B 
in Theorem 3. □ 

However, note that our proposed implementation of 
the binary case for achieving the minimum error proba- 
bility 



Pr 



1/2 - y/N s (l - N s ) 



(38) 



in Section IV uses a different probe state (with the same 
signal energy and performance) for practical reasons. 

In the general (M > 2) case, it appears that closed- 
form solutions for A and B appearing in the expression 
( 36 1 for the optimal state cannot be obtained, so recourse 



to numerical evaluation becomes necessary. Simulations 
of the resulting performance are presented in Section 
III.E. 



B. Optimal measurement 

Let us briefly discuss the quantum measurement, as 
determined by the corresponding POVM, that optimally 
distinguishes the output states {\i/) m )iR}m=o ■ Following 
refs. [3 QS ED], the optimum POVM, the SRM, is a rank- 
one measurement with elements Il m = \Xm) IR{Xm\-> m € 
%m (\Xm)iR ma y have norm less than one) with 



'M-\ 



-1/2 



IR, 



(39) 



, n=0 



where the operator in parentheses (and its inverse) is 
defined on just the span of {\ip m )iR}m=o • For a single- 



mode probe of the form 
orem 2, we have 



fa\n) s of The- 



\Xm)R 



1 



E • 



irnnO m 



n) r. 



(40) 



Note that the optimum measurement is the same for any 
two single-mode probe states \i[>)s, I^Os with p,p' having 
the same support. The optimum state of ( 24 ) has no zero 



coefficients so that the optimum measurement elements 
are 



\Xm)R = 



1 



'M 



M-l 

E« 

n=0 



n)i 



(41) 



Since these vectors form an orthonormal set, the mea- 
surement is a projective (von Neumann) measurement. 
Indeed, these measurement vectors coincide with the 
eigenstates of the unitary Pegg-Barnett phase oper- 
ator 35j on the truncated Hilbert space Hm-1 — 
span {|0)fl,...,|M -!)«}. 



C. Combined energy constraint 

We have so far assumed that a constraint is placed on 
the energy of just the signal modes. In some situations, 
it may make sense to constrain - as a measure of all the 
resources involved in state preparation - the average total 
energy in the signal and idler modes combined without 
restricting either individually. In other words, we impose 
the constraint 



N) := N 




Ci') 



(42) 



for a given number N, where {Ng} and {N^/ ■*} are the 
modal signal and idler photon number operators respec- 
tively, and ask for a state satisfying this constraint that 
minimizes the error probability. This new problem has 
the same solution that wc found before. 

Theorem 4. Among pure-state probes, the minimum er- 
ror probability achievable under a combined energy con- 
straint N is identical to the minimum error probability 
achievable under a signal energy constraint N . 

Proof. The combined energy constraint of N is clearly 
more restrictive than a signal energy constraint of N. We 
showed in Section III. A that the optimal pure-state probe 
for a signal energy constraint of N is a signal-only state of 
energy N. Since this state also has combined energy N, 
it remains the optimal state under the combined energy 
constraint. □ 



D. Mixed probe states 

In our work so far, we have assumed that the probe 
state \ip)is was pure. Of course, we may also use a mixed 
probe state pis resulting in the mixed output states 

Prn = (jj ® U m ) PIS (jl <8> (43) 

for U m given by Eq. 0. We now show that allowing for 
mixed probes does not lead to improved performance. 
We actually prove a stronger result. 

Theorem 5. Let pis be a mixed state with ensemble de- 
composition pis — Kjlipj) isfyjl and with signal en- 
ergy tv(pisNs) < N or with combined energy tr (p is N) < 



N. A transmitter preparing the ensemble {\ipj)is} with 
probabilities {ftj} and a receiver making optimal mea- 
surements conditioned on knowledge of j cannot beat the 
performance of the optimal pure-state probe under either 
energy constraint. 

Proof. Let P e ['] denote the minimum error probability 
attainable on using the argument as probe state. We 
have the chain of inequalities 



P, 



[Pis. 



3 



>Pe 
> Pp 



/,opt\ 



(44) 
(45) 

(46) 
(47) 



In ( 44 ) , the right-hand side represents the optimum per- 
formance under the conditions of the theorem statement 
and the inequality holds because the performance given 
by the left-hand side is obtained when knowledge of j 
is ignored by the receiver [36]. \ip*)s is the state of the 
form (21) with the same p as \ijjj)is, denoted p,-, and 



(451 holds by Theorem 1. In (H6j), \ip) s is the state of 



the form (21) with p = Y). 7T,Pj , and the inequality is 



true because P e of (27) is a sum of convex functions of 
p , and hence convex itself. None of the state trans- 
formations above has increased the signal (or combined) 
energy from that of pis, so that \ip)g is a pure state with 
energy bounded by N. It cannot beat the optimum pure 
state |i/ ,opt )s with energy N. □ 

With Theorem 5 in hand, we may conclude that the 
state of Theorem 3 is in fact the pure or mixed state of 
energy at most N$ that has the lowest error probability. 
This result has the following implication. Recall that a 
classical state of the signal and idler modes is a density 
operator pjg expressible as a mixture of coherent states 
in the form 



pis 



P{c t ,f3)\a) I \f3) SI (a\s(f3\d 2 ad 2 /3 (48) 



where \a)j (|/3)s) is a multimode idler (signal) coher- 
ent state and P(ct,(3) > is a probability density [55] , 
Such states are readily prepared from laser outputs using 
beam splitters and classical random numbers, and stan- 
dard measurements made on them are quantitatively de- 
scribable using semiclassical photodetection theory [3D]. 
The optimal state (24) is a nonclassical state (coherent 



states are the only pure classical states), and being opti- 
mal, it performs better than the coherent state of energy 
Ns (see Fig. 3). However, from Theorem 3 alone, we 
cannot conclude that it performs better than an arbi- 
trary classical state of the form ( 48 ) . With the addition 
of Theorem 4, we can draw the conclusion that the non- 
classical state ( 24 ) outperforms all classical states. Note 



that our argument does not imply that the coherent state 
of energy Ns is the optimal classical state with energy 
N s . 
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FIG. 3: The error probability P e of the optimum probe given 
by Eq. (24 1 (solid) and the coherent-state probe (dashed) as 
a function of Ns for M = 8, 16, 32, and 64. Curves for larger 
M are shown lighter. 



E. Numerical results 

In this section, we numerically compare the perfor- 
mance of the optimal state of Theorem 3 with that of 
some standard states in quantum optics. We first con- 
sider some signal-only states and then a couple of two- 
mode entangled states. With the exception of the op- 
timum state (24), the performance curves are obtained 



by computing the first row Go of the Gram matrix, for 
which analytical formulas are given below for each family 
of states. We then compute the eigenvalue vector A (or 
equivalently p) using an FFT routine (cf. Eq. (30)), and 
finally the error probability P e via Eq. (27). 



1. Optimal state 

For each value of the probe signal energy Ns, the p of 
the optimal state is first obtained by numerically solving 
for A and B appearing in (36). We then compute the 



error probability via (27) and plot it against Ns in Fig. 3 
for a few values of M. We find that as Ns — > (M — 
l)/2, A — > y/M and B — >_0_ L so that the numerically 



computed optimal state of (36 1 approaches the uniform 
superposition state of Theorem 2(6). The approach to 
zero error in the same limit is clearly visible in Fig. 3. For 
comparison, the coherent state performance (See section 
III.E.2) is also plotted alongside. 



2. Squeezed state and coherent state 



The single-mode squeezed states [41] are a well-known 
class of states. Let as = n*b — vb\ where |/i| 2 — \v\ 2 = 1 
and & is in a coherent state |a) satisfying b\a) = a\a), 
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FIG. 4: (Color online) Minimum error probability as a func- 
tion of signal energy Ns for the states of Section III.E for the 
binary case M = 2. The squeezed state has squeeze parame- 
ters fi, v optimized for each Ns, and the two-mode squeezed 
vacuum has J = 1. 



a > 0. Then, the signal mode as is in the squeezed state 
(called two-photon coherent state (TCS) in [5T]) \a; fi, v). 
In the following, we assume /i, v are real and /i > 0. If 
v < 0, this transformation corresponds to squeezing in 
the imaginary (phase) quadrature while stretching the 
real quadrature. The average energy of this state may be 
calculated to be 



a\a s 



(/i — v) a 2 + v 2 



(49) 



The action of the m-th phase rn takes as to — 
a$e m in the Heisenberg picture so that 



lie 



ve 



St. 



(50) 



Thus, the Schrodinger-picture state at the end of these 
transformations is \a; fie~ l6m , ve l9m )R. 

We may now use Eq. (3.25) of ref. [H] to write down 
the Gram matrix elements Go m :- 



/e l9m - [iv(e 2ie " 



x exp 



-1 



1 



(51) 



The coherent state \a)s is identical to the TCS 
\a; 1,0) r and its Gram matrix elements can be obtained 
from eq. (51 1. 



3. Two-mode squeezed vacuum state 

The two-mode squeezed vacuum (TMSV) state with 
signal energy Ns is the two-mode state [55]:- 



\i>™* v (Ns)} IS = 



1 



T Sv^rf^ |n)/l?1)s - 

(52) 



We may also consider using J copies of a TMSV state as 
probe. In order to keep the same signal energy Ns, we 
must use J copies of \tp TMSV (Ns/ J))is- We may directly 
compute the first row of the Gram matrix of the output 
states as 



Con 



1 



^(1 



G Z 



M ■ 



(53) 



4- Pair-coherent state 

The pair-coherent states (PCS) are a family of two- 
mode states parametrized by £ G C and a non-negative 
integer q [35]. We will consider the case q = corre- 
sponding to equal energy in the signal and idler modes. 
Such a PCS has the form 



1 



(54) 



where Jq(") is the modified Bessel function of first kind 
and order zero. The signal energy Ng is related to £ via 



N s = 



\c\mc\) 

W\C\) : 



(55) 



where Ji(-) is the modified Bessel function of first kind 
and order one. Since the phase of C does not affect the 
performance, we will assume C to be real and positive. 
The Gram matrix elements may be computed to be 



Gom — 



i8m/2 



m <G Z 



(56) 



5. Performance curves 

Numerical results for M = 2 and M = 8 are shown in 
Figs. 4 and 5 respectively. For the special case of binary 
discrimination, the minimum error probability is 



(57) 



where a — ir(iPo\iPi)ir- For each of the probe states 
considered above, we have 



CCS 
CTSS 

f TMSV 
CPCS 



e -2N s 

e -2N s {N s + l) 
1 

(i + 2N s /jy 

Jo(2Q 
4> (2C) ' 



(58) 
(59) 

(60) 
(61) 



For the squeezed-state (SS) probe, an optimal squeezing 
has been assumed (see below) . For multiple copies of two- 
mode squeezed vacuum, note that otmsv i s decreasing 
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FIG. 5: (Color online) Minimum error probability as a func- 
tion of signal energy Ns for the states of Section III.E for 
M = 8. The squeezed state has / = v 2 /Ns optimized for 
each Ns and the two-mode squeezed vacuum uses J = 15 
signal modes. 



in J and cttmsv - ► as J — > oo. For certain values 
of Ns, the PCS performs best among all but the optimal 
state. 

For M — 8, on the other hand, the PCS performs the 
worst and is consistently beaten by the coherent state. 
The squeezed state with an optimized amount of squeez- 
ing performs the best among the suboptimal states and 
is closely matched by the two-mode squeezed vacuum, 
both of which consistently beat the coherent state. 

For the squeezed-state probe, we can further discuss 
the optimal squeeze parameters [A, v for a given energy 
Ns- For symmetric phase discrimination, we might as- 
sume that phase-squeezing, /i > 0, v < 0, is optimal. The 
following is what we find. 



• M = 2. The optimal v = N s /^1 + 2N S is posi- 
tive. The phase space representation is shown in 
Fig. 6(a) for this case. Note that for a given Ns, 
the mean (clr) is independent of the sign of v. 

• M = 3. Numerically, we find the optimal v is pos- 
itive. 

• M = 4. Numerically, no squeezing v = (coherent 
state) appears to be optimal. 

• M > 5. Numerically, we find the optimal v is neg- 
ative. For large M, phase-squeezing can be seen 
to reduce the overlap between neighboring states 
(Fig. 9(6)). 



IV. IMPLEMENTATION OF CONDITIONALLY 
OPTIMAL BINARY PHASE SHIFT KEYING 

In this section, we show that the performance of the 
optimal state for the M = 2 case, the state of Corollary 



m = 







m — 1 


m = 



(a) v > 



(b) v < 



FIG. 6: Phase space representation of M = 2 squeezed-state 
probe: (a) Amplitude-squeezed, (b) Phase-squeezed. 



(a) v > 



(b) v < 



FIG. 7: Phase space representation of M = 3 squeezed-state 
probe: (a) Amplitude-squeezed, (b) Phase-squeezed. 



(a) v > 



(b) v < 



FIG. 8: Phase space representation of M = 4 squeezed-state 
probe: (a) Amplitude-squeezed, (b) Phase-squeezed. 





(a) v > 



(b) v < 



FIG. 9: Phase space representation of M = 8 squeezed-state 
probe: (a) Amplitude-squeezed, (b) Phase-squeezed. 



11 



1, can be readily obtained in the laboratory with current 
technology. Furthermore, the inclusion of transmission 
losses and sub-unity detection efficiencies leads to occa- 
sional inconclusive outcomes (or erasures) but leaves un- 
changed the error performance conditioned on no erasure. 
These results are applicable both to long-distance com- 
munication based on binary phase shift keying (BPSK) 
and to (short-distance) phase sensing or reading of a 
phase-encoded memory of the type described in ref. |llj . 
The optimal probe state of Eq. (36), and thus also 



the M — 2 optimal state of Eq. (37 1 (we are assum- 
ing Ng < 1/2 for the latter statejf is hard to pre- 
pare in a deterministic fashion. However, several tech- 
niques exist [43-45 to prepare such Fock-state superpo- 
sitions in a conditional manner, of which some have been 
demonstrated experimentally [35] • The state (37) can 
be thought of as a qubit state in a "single-rail" encod- 
ing [46j . The optimal measurement on the output states 
following the phase shift is, via Eq. (|4T|), a projective 



measurement onto the basis {(|0)s ± |l)s) /v2}< Such a 
measurement would require implementing a unitary tak- 
ing the {(|0)<? ± /V2} basis to the {|0) <?, |1)<?} basis, 
followed by photodetection. However, it is well-known 
(see ref. [46] and references therein) that it is impossible 
to effect the required unitary transformation determinis- 
tically with linear optics, leading to further inefficiencies 
in an implementation involving the state (37). 



By an appeal to Theorem 1, both the state preparation 
and measurement issues can be circumvented. To do so, 



we use instead of (37) the probe state 



Wis = Vl-Ns 



(62) 



where the S and / modes may be spatially separated (as 
in Fig. 10) or may correspond to orthogonal polarization 
degrees of freedom of the same spatiotemporal mode. In 
effect, we have switched to a dual-rail qubit encoding [46] 
in going from (37) to (62). Note that the states (37) and 



( 62 1 have the same p and p and so have the same signal 



energy and error performance. 

As shown in Fig. 4, the state (62) can be prepared 
by directing a single photon to a Ng ■ 1 — Ng beam 
splitter. Further, Eq. (39) dictates that the optimum 



POVM measures the basis {(|01) JS ± |10) /s ) /y/2}. This 
may be accomplished by a 50:50 beam splitter followed by 
single-photon detection using two detectors at the output 
ports of the beam splitter. Hypothesis m E {0, 1} is 
declared if detector D m clicks [47] . 

Let us first consider ideal operation and ignore the 
small beam splitters in Fig. 4 representing transmission 
and detection losses. It can be verified that the two pos- 
sible states in the output modes i?o and R\ of the second 
beam splitter just before the detection are given by 

|^0(l))ilofli = A +(-)l 10 )flo«i + A -(+)|0!).RoA.i> ( 63 ) 
where 

VT^Ng-±ijN~s 



A 



+(-) 



V2 



(64) 



One of the detectors always clicks and the error proba- 
bility evaluates to 



P. 



Ng), 



(65) 



which agrees with Eq. ( 38 1 



Let us now consider the performance of the setup of 
Fig. 4 in a realistic setting that includes transmission 
loss and non-unity quantum efficiency of the detectors 
as shown. We assume that the detectors have negligible 
dark count rates. Transmission loss may be the dominant 
factor in a long-distance communication system but may 
be negligible in the reading of a memory. For simplicity, 
we assume rjg = rji = t]t < 1, which is a realistic as- 
sumption in the communication context, although it can 
be relaxed without altering our conclusions much. We 
also assume r)p = rj^ = rjrj and let i] = r/Ti]D- 

By following the evolution of the probe state through 
the system while preserving unitarity by adding vacuum- 
state input modes at each of the small beam splitters, we 
can write the state of the entire system just before the 
detectors as 



IV'O(I)) =VV (A + (-)|10)r dHi + A_ (+) |01) jRoi?1 ) 



(66) 

x lOOOO)^^; + 10)^0)^,0^5^;, 

(67) 

where S, I, D' and D[ denote the leakage modes and 
|0Ioss)7'S'Z5 ( ^li; denotes an (un- normalized) state of those 
modes whose squared norm is (1 — rf) and is an eigenstate 
of the total photon number in the leakage modes with 
eigenvalue one. This second term corresponds to the case 
when none of the detectors click, i.e., an erasure occurs. 
The probability of erasure P e is then independent of m 
and equals 



Pe = 1 - V- 



(68) 



On the other hand, when one of the detectors does 
click (which happens with probability ry), the error 
probability of the ensuing decision is exactly the same 
as before, that given by (38). Note the erasure prob- 



ability is independent of Ng and the error probability 
conditioned on no erasure is independent of r\. This 
latter probability is identical to the error probability 
obtainable from the optimum state of Corollary 1. In 
particular, if Ng = 1/2, there is no error whenever there 
is no erasure. 



V. CONCLUDING REMARKS 

In this paper, we have set up an M-ary phase dis- 
crimination problem that naturally models phase-based 
communication, phase sensing, and quantum reading of 
a phase-based digital memory. Allowing for a general 
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FIG. 10: Setup for realizing the optimal BPSK performance with signal energy Ns- The signal (S) and idler (/) modes are 
taken to be spatially separated modes with the same polarization and transverse spatial pattern. The Ns : 1 — Ns beam splitter 
synthesizes the optimal probe state from a single-photon state. The shaded box containing the phase unitary transformation U m 
on the signal mode represents the sender's modulator and the transmission medium in a communication scenario or the phase- 
encoded memory element in a quantum reading scenario. The 50:50 beam splitter performs a rotation of the ideal measurement 
basis into the single-photon states of the output modes Ro and Ri . The small beam splitters with the indicated transmittances 
model the transmission losses suffered by the signal mode (fjs), the idler mode (??/), and the quantum efficiencies of the detectors 
Do and D\. Hypothesis m is declared when detector D m clicks. If neither detector clicks, an erasure has occurred. 



entanglement-assisted probing strategy, we have charac- 
terized the equivalence classes of probe states with the 
same performance. We have found the exact form of the 
optimizing probe state as a function of the energy Ns 
and characterized the probes that allow zero-error dis- 
crimination. From a theoretical point of view, we have 
thus completely solved a constrained bosonic channel dis- 
crimination problem, a class of problems for which exact 
solutions are rare [14] . We have studied the error perfor- 
mance of some standard states in quantum optics. For 
the M = 2 case important to reading of a memory, we 
have shown that the optimal performance can be read- 
ily obtained with current technology conditioned on no 
erasure due to system losses. 

From a more practical point of view, the analysis here 
is limited by not having included the effect of system 
losses in general. We note that including loss in our prob- 
lem model brings it into the general lossy image sensing 
framework considered in ref . [SO] so that the result of that 
paper on the form of the optimal probe can be used as a 
starting point for analysis. Nevertheless, the work of this 
paper remains essential to the subsequent analysis of the 
lossy system performance. 

Another serious practical problem is that of synthesiz- 



ing the optimal probe states in either the lossy or lossless 
cases as well as realizing the optimal POVMs on them. 
It may be hoped that the flexibility in state prepara- 
tion afforded by Theorem 1 can partially alleviate these 
problems, though it remains to be seen if optimal or near- 
optimal performance can be achieved in practice for some 
instances of the problems considered here. 
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